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(57) Abstract 

Hybrid and novel polyketide synthases and polykctidcs are produced by use of a multiple vector system. The combinatorial possibilities 
offered by placing the various catalytic activities of PKS systems on separate vectors permits the construction of improved libraries of PKS 
and polyketides. In addition, polyketides can be produced in hosts that ordinarily do not produce polykctidcs by supplying, along with an 
expression system for the desired PKS, an expression system for holo ACP synthase. 
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PRODUCTION OF POLYKETIDES IN BACTERIA AND.YEAST 



This application claims priority under 35 USC 119 from provisional 
5 application 60/033,193 filed 18 December 1996. The contents of this provisional 
application are incorporated herein by reference. 

Technical Field 

The invention relates to production of polyketides in microbial hosts such as 
1 0 yeast and E. coli and to preparation of libraries containing a variety of functional 
polyketide synthases (PKSs) and the resulting variety of polyketides. More 
specifically, it concerns supplying portions of the polyketide synthase systems on 
separate vectors for simplicity in mixing and matching these portions to create a 
variety of PKS resultants. This permits production of libraries of polyketide 
1 5 syntheses and polyketides through a combinatorial approach rather than manipulation 
focused on a single production system. 

Background Aft 

Polyketides represent a singularly useful group of natural products which are 
2 0 related by their general pathway of biosynthesis. Representative members include the 
macrolide antibiotics, for example, erythromycin, spiramycin and tylosin, 
immunosuppressants such as rapamycin and FK506, antiparasitics such as the 
avermectins, antifungal agents such as amphotericin B and nystatin, anticancer agents 
such as daunorubicin and doxorubicin and anticholesterolemics such as mevinolin. 

2 5 Polyketides generally arc secondary metabolites of the actinomycetes including the 

genera Streptomyces^ Actinomyces, Actinomadura, Micromonospora, 
Saccharopolyspora, and Nocardia. It was estimated that in 1 986 about 6,000 
antibiotics of microbial origin had been characterized of which 70 were in clinical 
use; an additional 1 100 metabolites were reported between 1988 and 1992, 

3 0 approximately 40% of which were polyketides. 

Despite the multiplicity of polyketide structures available from nature, there 
remains a need to expand the repertoire of available polyketides and to synthesize a 
multiplicity of polyketides in the form of libraries so that there is a convenient 
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substrate for screening to identify polyketidcs that arc relevant to a specific target of 
interest. The present invention provides solutions to these needs 

Polyketides generally are synthesized by condensation of two-carbon units in a 
manner analogous to fatty acid synthesis. In general, the synthesis involves a starter 
5 unit and extender units; these "two-carbon*' units are derived from acylthioesters, 
typically acetyl, propionyl, malonyl or methylmalonyl coenzyme-A thioesters. There 
are two major classes of polyketidc synthases (PKSs) which differ in the "manner" in 
which the catalytic sites are used - the so-called "aromatic" PKS and the modular 
PKS. The present invention employs coding sequences from both these classes as 
1 0 will further be explained in the herein application. 

Recombinant production of heterologous functional PKS - i.e., a PKS which 
is capable of producing a polyketide - has been achieved in Sirepiomyces and hybrid 
forms of aromatic PKSs have been produced in these hosts as well See, for example, 
Khosia, C. etal ./fiacrmo/ (1993) 175:2194-2204, Hopwood, D A. etaL Nature 
15 (1985)314:642-644; Sherman, D.H. el ai J fiacteriol (] 992) 174:6184-6190 In 
addition, recombinant production of modular PKS enzymes has been achieved in 
Streptomyces as described in PCT application WO 95/08548 In all of these cases, the 
PKS enzymes have been expressed from a single vector. A single vector which 
carried genes encoding PKS catalytic sites was transformed into K coli by Roberts, 
2 0 G.A., etaL, EurJBiochem (1993) 214:305-31 1, but the PKS was not functional, 
presumably due to lack of pantothenoylation of the acyl carrier proteins. 

The present invention provides double or mukivector systems for production 
of PKS and the resultant polyketides in a variety of hosts. The use of multiple vectors 
provides a means more efficiently to enhance the number of combinatorial forms of 
2 5 PKS and polyketides that can be prepared. Addition of the machinery for 

pantothenoylation of the acyl carrier proteins (i.e., a holo ACP synthase) permits 
production of polyketides in a wide spectrum of hosts. 

Disclosure of the Invention 
^ ^ The invention relates to recombinant materials for the production of 

polyketides in a wide variety of hosts and of libraries of PKS enzymes and the 
resultant polyketides based on a multiple vector system. The use of a multivector 
system facilitates the construaion of combinatorial libraries and permits more 
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flexibility in designing various members thereof The invention alsarelates to such 
libraries which are essentially self-screening due to an autocrine system involving 
polyketide-responsive receptors. 

Thus, in one aspect, the invention relates to a recombinant host cell and 
S libraries thereof when the host cell is modified to contain at least two vectors, a first 
vector containing a first selection marker and a first expression system and the second 
vector containing a second selection marker and a second expression system and 
optionally additional vectors containing additional selectable markers and expression 
systems, wherein the expression systems contained on the vectors encode and are 

1 0 capable of producing at least a minimal PKS system. If the minimal PKS system is an 
aromatic system, the minimal system will comprise a ketosynthase/acyl transferase 
(KS/AT) catalytic region, a chain length factor (CLF) catalytic region and an acyl 
carrier protein (ACP) activity. If the minimal PKS system is a modular system, the 
system will contain at least a KS catalytic region, an AT catalytic region, and an ACP 

1 5 activity. For modular systems^ these activities are sufficient provided intermediates in 
the synthesis are provided as substrates; if de novo synthesis is to be required, a 
loading acyl transferase should be included, which will include another AT and ACP 
region. 

In one specific embodiment of this aspect of the invention, the recombinant 
2 0 host cell will be modified to contain: (a) a first vector comprising a first selectable 
marker and an expression system comprising a nucleotide sequence encoding a 
ketosynthase/acyl transferase (KS/AT) catalytic region of an aromatic PKS operably 
linked to a promoter operable in said cell; (b) a second vector comprising a second 
selectable marker and an expression system comprising a nucleotide sequence 

2 5 encoding a chain length factor (CLF) catalytic domain operably linked to a promoter 

operable in said cell; and (c) a third vector containing a third selectable marker and an 
expression system which comprises a nucleotide sequence encoding an acyl carrier 
protein (ACP) activity operably linked to a promoter operable in said cell, and to 
libraries comprised of colonies of such cells. Alternatively, two of the vectors can be 

3 0 combined so that the host cell contains only two vectors; the vector containing two 

expression systems may maintain these as separate expression systems or two open 
reading frames may be placed under the control of a single promoter 
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In another specific embodiment, the invention relates to a cell-modified to 
contain a first vector containing a first selectable marker and an expression system for 
at least one minimal module of a modular polyketide synthase (PKS) operably linked 
to a promoter operable in said cell, and a second vector containing a second selectable 
5 marker and a nucleotide sequence encoding at least a second minimal module of a 
modular polyketide synthase operably linked to a promoter operable in said cell, and 
to libraries comprising colonies of such cells 

In another variation, one or more expression systems for a defined portion of a 
PKS system is integrated into the host chromosome and at least one additional 
1 0 expression system resides on a replicable vector. Thus, in the case of aromatic PKS, 
an expression system for one of the open reading fi^ames may first be integrated into 
the chromosome and expression systems for other open reading frames may reside on 
vectors. In the case of a modular PKS, an expression system for one or more modules 
may reside on the chromosome and additional expression systems for one or more 
1 5 modules reside on vectors. The integration of such expression systems into the 
chromosome can occur either through known phage-mediated integration or by 
homologous recombination. 

The invention also is directed to novel polyketides produced by the methods of 
the invention and to methods to screen the polyketide libraries obtained. 
2 0 In still another aspect, the invention is directed to methods to obtain the 

synthesis of polyketides in hosts that lack a mechanism for activation of the acyl 
carrier proteins - i.e., which lack holo AC? synthases. By supplying an expression 
system for a compatible holo ACP synthase either on a separate vector, on one of the 
vectors in a multiple vector system (or on a single vector for PKS expression), or as a 
2 5 fusion protein with a PKS or portion thereof, hosts such as E. coli, yeast, and other 
microbial systems which do not customarily synthesize polyketides can be made into 
convenient hosts. This obviates the necessity for supplying "clean" hosts from 
polyketide-producing strains of, for example, Streptomyces. 



30 



Brief Desc ription of the Drawings 

Figure 1 is a diagram showing the composition of several typical aromatic 

PKS. 
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Figure 2 is a diagram showing the organization of erythromycin PKS as 
typical of a modular PKS 

Figure 3 is a diagram showing the organization of the fungal PKS system, 
6-methyl salicylic acid synthase (6-MSAS) 
5 Figure 4 is a diagram which shows the conceptualization of a multivectored 

modular PKS system. 

Figure 5 is a diagram of a multivectored aromatic PKS system. 

Figure 6 shows, diagrammatically, the construction of a vector for expression 
of a holo-ACP synthase and a vector for the expression of the gene encoding 
1 0 6-MSAS, both vectors for use in yeast. 

Figure 7 shows the results of HPLC run on supematants of yeast cultures 
transformed with various vectors of the invention. 

Figures 8A and 8B show the kinetics of production of the antibiotic 6-methyl 
salicylic acid (6-MS A) in yeast (Figure 8A) and in A. coli (Figure 8B). 
1 5 Figure 9 shows the expression systems for two modular PKS for use in vectors 

compatible with yeast along with the expected products. 

Modes of Carrying Out the Invention 

The invention in various aspects employs various components of the aromatic, 
2 0 PKS system, the modular PKS system, a fungal PKS system, or modified forms 
thereof or portions of more than one of these. The general features of aromatic, 
modular and fungal PKS systems are shown in Figures 1, 2 and 3 respectively. 

"Aromatic" PKS systems are characterized by the iterative use of the catalytic 
sites on the several enzymes produced. Thus, in aromatic PKS systems, only one 

2 5 enzyme with a specific type of activity is produced to catalyze the relevant activity for 

the system throughout the synthesis of the polyketide. In aromatic PKS systems, the 
enzymes of the minimal PKS are encoded in three open reading frames (ORFs). As 
shown in Figure I, the actinorhodin PKS is encoded in six separate ORFs. For the 
minimal PKS, one ORF contains a ketosynthasc (KS) and an acyltransferase (AT); a 

3 0 second ORF contains what is believed to be a chain-length factor (CLF); and a third 

reading frame encodes an acyl carrier protein (ACP) Additional ORFs encode an 
aromatase (ARO), a cyclase (CYC), and a ketorcductasc (KR). The combination of a 
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KS/AT, ACP, and CLF constitutes a minimal PKS, since these elements are necessary 
for a single condensation of a two-carbon unit. 

On the other hand, the gris PKS contains five separate ORFs wherein the 
KS/AT, CLF, and ACP are on three ORFs, the KR is on a fourth, and the ARO is on a 

5 fifth. 

In the "modular" PKS systems, each catalytic site is used only once and the 
entire PKS is encoded as a series of "modules " Thus, the modular synthase protein 
contains a muhiplicity of catalytic sites having the same type of catalytic activity A 
minimal module contains at least a KS, an AT and an ACP. Optional additional 

1 0 activities include KR, DH, an cnoylreductasc (ER) and a thioesterase (TE) activity 

Figure 2 shows, diagrammatically, the organization of the modular PKS system for 
the synthesis of the immediate precursor, 6-dEB, for the antibiotic erythromycin. As 
shown, there is a loading region followed by six modules; the thioesterase on module 
6 effects release of the completed 6-deoxyerythronolide B (6-dEB) from the synthase 
15 to which it is coupled through a phosphopantotheinyl group The diagram shows the 
progressive formation of the 6-deB which is cyclized after removal from the holo 
ACP on module 6 of the synthase. To convert 6-deB to erythromycin A, two sugar 
residues arc added in subsequent reactions through the hydroxyl groups at positions 3 
and 5. 

2 0 The "ftingal" PKS encoding 6-methyl salicylic acid synthase (6-MS AS) has 

some similarity to both the aromatic and modular PKS. It has only one reading frame 
for KS, AT, a dehydratase (DH), KR and ACP. Thus, it looks similar to a single 
module of a modular PKS. These sites are, however, used iteratively. Unlike an 
aromatic PKS, it does not include a CLF, as shown in Figure 3. 

2 ^ The invention herein employs expression systems for the catalytic activities 

involved in all of the aromatic, modular and fungal PKS systems The proteins 
produced may contain the native amino acid sequences and thus the substrate 
specificities and activities of the native forms, or altered forms of these proteins may 
be used so long as the desired catalytic activity is maintained. The specificity and 

3 0 efficiency of this activity may, however, differ from that of the native forms. Certain 

activities present in the native system, however, can be intentionally deleted. Further, 
components of various aromatic systems can be mixed and matched, as well as can 
components of various modules of the module systems. PCT application 
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WO 95/08548. referenced above and incorporated herein by reference describes the 
construction of hybrid aromatic PKS systems where, for example, open reading 
frames of actinorhodin are included in expression vectors with open reading frames 
from alternative aromatic systems. 
5 Expression systems for the PKS proteins alone may not be sufficient for actual 

production of polykctidcs unless the recombinant host also contains holo ACP 
synthase activity which effects pantothenoylalion of the acyl carrier protem This 
activation step is necessary for the ability of the ACP to "pick up" the "2C" unit 
which is the starter unit or the growing polyketide chain in the series of Claisen 
1 0 condensations which result in the finished polyketide. For hosts lacking a 
phosphopantothenoylating enzyme that behaves as a holo ACP synthase, the 
invention provides means for conferring this activity by supplying suitable expression 
systems for this enzyme The expression system for the holo ACP synthase may be 
supplied on a vector separate from that carrying a PKS unit or may be supplied on the 

1 5 same vector or may be integrated into the chromosome of the host, or may be supplied 

as an expression system for a fijsion protein with all or a portion of a polyketide 
synthase In general, holo ACP synthases associated with fatty acid synthesis are not 
suiuble; rather, synthases associated specifically with polyketide synthesis or with 
synthesis of nonribosomal proteins are usefijl in this regard. 
20 Specifically, the modular and fungal PKS systems are not activated by 

phosphopantothenoylation effected by the phosphopantothenoylation enzymes 
indigenous to E, coU\ however, enzymes derived from Bacillus^ in particular the 
gramicidin holo ACP synthase of Bacillus brevis and the surfactin-relatcd holo-ACP 
synthase from Bacillm subtilis can utilize the modular and fungal PKS ACP domains 

2 5 as substrates. As shown in the Examples below, while inclusion of an expression 

system for an appropriate holo-ACP synthase is not necessary for just the expression 
of the genes encoding fungal or modular PKS in E. coli or yeast, inclusion of such 
expression systems is required if polyketides are to be produced by the enzymes 
produced. 

It should be noted that in some recombinant hosts, it may also be necessary to 
activate the polyketides produced through postsynthesis modifications when 
polyketides having antibiotic activity are desired. If this is the case for a particular 
host, the host will be modified, for example by transformation, to contain those 
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enzymes necessary for effcaing these modifications Among such enzymes, for 
example, are glycosylation enzymes. 

The combinatorial possibilities for synthesis of aromatic PKS systems depend 
on the nature of the iteratively used sites and the presence or absence of the optional 
activities that are not part of the minimal PKS system required for the Claisen 
condensation which represents the synthetic mechanism for the end-produa 
polyketide. Thus, while the aromatic PK synthase must contain a KS/AT, ACP and 
CLF, the other catalytic activities, i.e. KR, ARO, and CYC are optional. Fungal PK 
synthases require only KS, AT, and ACP functionalities, as do the modular PKS 

1 0 systems. Various combinations of these activities from various sources can be used as 
well as their mutated forms. 

Because the catalytic sites are used only once in the modular PKS systems, the 
combinatorial possibilities in this type of synthase are greater. The combinatorial 
potential of a modular PKS is given by: ATl x (ATk x 4)^ where ATl is the number 

15 of loading acyl transferases, ATe is the number of extender acyl transferases, and M is 
the number of modules in the gene cluster. The number 4 is present in the formula 
because this represents the number of ways a keto group can be modified by either 
1) no reaction; 2) KR activity alone; 3) KR+DH activity; or 4) KR+DH+ER activity. 
It has been shown that expression of only the first two modules of the erythromycin 

2 0 PKS resulted in the production of a predicted truncated triketide product. See Kao, et 
al. J Am Chem Soc (1994) U6]\6\2'\]613. A novel 12-membered macrolide 
similar to methymycin aglycone was produced by expression of modules 1-5 of this 
PKS in S. coclicolor. See Kao, C. etai J Am Chem Soc (1995) 117:9105-9106. This 
work, as well as that of Cortes, J. et aL Science (1995) 268: 1487-1489, shows that 

2 5 PKS modules arc functionally independent so that lactone ring size can be controlled 

by the number of modules present. 

In addition to controlling the number of modules, the modules can be 
genetically modified, for example, by the deletion of a ketoreductase domain as 
described by Donadio, S. et al Science (1991) 252:675-679; Donadio, S. et aL Gene 

3 0 (1992) 115:97-103. In addition, the mutation of an enoyl reductase domain was 

reported by Donadio, S. eiaL Proc Natl Acad Sci USA (1993) 90:7119-7123 These 
modifications also resulted in modified PKS and thus modified polyketides. 
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As stated above, in the present invention, the coding sequences for catalytic 
activities derived from the aromatic, fungal or modular PKS systems found in nature 
can be used in their native forms or modified by standard mutagenesis techniques to 
delete or diminish activity or to introduce an activity into a module in which it was 
5 not originally present. For example, a KR activity an be introduced into a module 
normally lacking that function. 

While the art, as set forth above, has succeeded in producing some novel 
polyketides by virtue of construction of hybrid and/or altered aromatic or modular 
PKS systems in Strcpiomyccs from a single expression vector, advanuge has not been 
1 0 taken of using a multiple vector system in host cells generally in order to produce a 
wider variety of synthases. By "multiple" is meant two or more; by "vector" is meant 
a nucleic acid molecule which can be used to transform host systems and which 
contains both a selectable marker and an independent expression system containing a 
coding sequence under control of a promoter and any other suitable sequences 
1 5 regulating expression. Typical such vectors are plasmids, but other vectors such as 
phagemids, cosmids, viral vectors and the like can be used according to the nature of 
the host. 

Of course, one or more of the separate vectors may result in integration of the 
relevant expression systems into the chromosome of the host. 
2 0 Neither have microbial hosts generally, such as E, coli and yeast, been used 

successfully to construct polyketides. It is believed that this is due to the lack of holo 
ACP synthase which, according to the methods of the invention, can be supplied to 
these hosts. 

Thus, in order to produce the polyketides of the invention, suitable hosts are 
2 5 modified to contain vectors, typically plasmids, which contain expression systems for 
the production of proteins with one or more of the activities associated with PKS. By 
placing various activities on different expression vectors, a high degree of variation 
can be achieved. A variety of hosts can be used; any suitable host cell for which 
selection markers can be devised to assure the incorporation of multiple vectors can 
30 readily be used. Preferred hosts include yeast, K coli, actinomycetes, and plant cells, 
although there is no theoretical reason why mammalian or insect cells or other 
suitable recombinant hosts could not be used. Preferred among yeast strains are 



wo 98/27203 _ j q . PCT/US97/23014 

Saccharomyces cerevisiae and Pichta pasions. Preferred actinomycetes include 
various strains of Strcpiamyces 

The choice of hosts, of course, dictates the choice of the control sequences 
associated with the expression system as well as the selectable markers Suitable 
5 promoter systems, for example, for use in E, coli include the tryptophan (trp) 

promoter, the lactose (lac) promoter, the T7 promoter and the X-derived Pl promoter 
and N-gene ribosome binding site. For yeast, suitable control sequences include 
promoters for the synthesis of glycolytic enzymes, such as 3-phosphoglycerate kinase 
Other promoters include those for alcohol dehydrogenase (ADH-I and ADH-2), 
1 0 isocytochrome-C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism and enzymes responsible for maltose and galactose utilization. It is also 
believed that terminator sequences are desirable at the 3* end of the coding sequences 

Suitable promoters for use in mammalian cells, actinomycetes, plant cells, 
insect cells and the like are also well known to those in the art. 
1 5 Selectable markers suitable for use in bacteria such as E. coU and 

actinomycetes generally impart antibiotic resistance; those for use in yeast often 
complement nutritional requirements. Selectable markers for use in yeast include, but 
are not restricted to IIRAS, LEin-d TRPl LYS2, HISL HIS3. Selectable markers for 
use in actinomycetes include, but are not restricted to those for thiostrepton-, 
2 0 apramycin- hygromycin-, and erythromycin-resistance 

Methods and materials for construction of vectors, transformation of host cells 
and selection for successful transformants are well understood in the art. 

Thus, according to one embodiment of the invention herein, a single host cell 
will be modified to contain a multiplicity of vectors, each vector contributing a 

2 5 portion of the synthesis of a PKS system. In constructing multiple vectors for 

production of aromatic PKS systems, the separate reading frames such as those shown 
in Figure 1 may be incorporated on separate vectors or, if properly constructed, 
portions of reading frames can be distributed among more than one vector, each with 
appropriate sequences for effecting control of expression For modular systems a 

3 0 single module or more than one module may reside as a part of an expression system 

on a single vector; multiple vectors are used to modify the cell to contain the entire 
desired PKS system. 
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As stated above, one or more of the expression systems introduced into the 
host may be integrated into the chromosome. 

Thus, to prepare the libraries of the invention, suitable host cells are 
transformed with the desired number of vectors; by using different selectable markers 
o on each vector desired as part of the modification, successful transformants which are 
modified by inclusion of all the desired vectors can be selected By using mixtures of 
a first vector with a first selectable marker containing a multiplicity of expression 
systems for a portion of a PKS synthase, and a mixture of a second vector with 
expression systems for a variety of a second portion of a PKS system, and so forth, 
1 0 colonies of successful transformants are obtained that have a combinatorial 

representation of "hybrid" PKS systems. By preparing panels of individual colonies 
of such successful transformants, a library of PKS systems is obtained and thereby a 
library of polyketides. An expression system for holo ACP synthase is also supplied 
if needed. The polyketides may be glycosylated depending on the nature of the host. 

1 ^ This approach can also be modified by effecting the integration of the 

appropriate portion of one or more of the multiple vectors into the chromosome of the 
host. Integration can be eflFected using suitable phage vectors or by homologous 
recombination. If homologous recombination is used, the integration may also delete 
endogenous PKS activity ordinarily residing in the chromosome, as described in the 

2 0 above-cited PCT application WO 95/08548. In these embodiments, too, a selectable 

marker such as hygromycin or thiostrepton resistance will be included in the vector 
which effects integration 

The libraries of polyketides can then be screened for activity with respect to 
any polyketide responsive target in order to identify particular polyketide members 
2 5 that will activate or otherwise bind to the target. Such screening methods are standard 
in the art. 

In a particularly preferred embodiment of the invention, the library can be 
made self-screening by introducing a polykctide-responsivc receptor that is 
intracellular to or is displayed at the surface of the host cell producing the polyketide 
30 itself This "autocrine" system allows the colonies to self-select for those activating 
the receptor. Such systems are described, for example, in an article by Broach, J.R. 
and Thomer, J., Nature (1996) 384:Supp 7:14-16 
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Autocrine systems need not be limited, however, to receptors,.but can include 
proteins that are expressed internal to the cell and whose interaction can be evaluated 
with respect to the polyketides produced, in a manner analogous to the yeast 2 hybrid 
system described by Fields in U S Patent 5,283,173. 
5 Thus, the cells are modified to create ^'cell-based detection systems for 

poiyketide function." The function of the polyketide may include agonist or 
antagonist activity with respect to a receptor which is either produced at the surface of 
the cell or produced mtracellularly, or the polyketides may be agonists or antagonists 
for two hybrid interaction screens so that it will be possible to select for protein- 
1 0 protein interaction inhibitors or cross-linking factors analogous to rapamycin and 
FK506. 

It should be noted, that such cell-based detection systems are also useful in 
screening libraries of polyketides which are produced fi-om cells containing only 
single vector systems Thus, these improvements are applicable not only to the 

1 5 multivector combinatorial libraries of the present invention but also to polyketide 
synthase and polyketide libraries produced using cells containing these systems on a 
single expression vector. 

As mentioned above, additional enzymes which effect post translational 
modifications to the enzyme systems in the PKS may need to be introduced into the 

2 0 host through suitable recombinant expression systems. In addition, enzymes that 
activate the polyketides themselves, for example, through glycosylation may be 
needed. It may also be necessary to modify the catalytic domains to alter their 
substrate specificity or to substitute domains with the ^propriate specificity. For 
example, it is generally believed that malonyl CoA levels in yeast are higher than 

2 5 methyl malonyl CoA; if yeast is chosen as a host, it may be desirable to include 

catalytic domains that can utilize malonyl CoA as an extender unit, such as those 
derived from spiramycin or tylosin. 

Figure 4 diagrams one embodiment of the conceptual basis of the present 
invention wherein three separate vectors are employed to produce a modular PKS. As 

3 G shown, each vector permits the construction of 64 different open reading frames using 

two extender ATs (one from methylmalonyl CoA and the other fi-om malonyl CoA) 
and the four combinations involving KR, DH, and ER as described above Thus, 
module No. 1 may employ malonyl CoA as an extender unit; module No. 2 
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mcthylmalonyl CoA; the opposite sequence can be used, or both ext^ders might use 
malonyl CoA or both might use mcthylmalonyl CoA. This results in four separate 
types of extender combinations, each of which is multiplied by the four KR/DH/ER 
variants Each separate plasmid offers the same set of possibilities, one of the 
5 plasmids must also contain a loading function and one must contain a thioesterase 
function. Thus, by construction of 192 plasmids, the upper limit of synthesis of novel 
polyketides is 64x64x64 or 262,144 molecules, providing an efficient method to 
obtain large numbers of novel polyketides. 

Figure 5 shows an approach to a multiple vector aromatic PKS that is set forth 
in greater detail in Example 1 1 hereinbelow In Figure 5, the three separate reading 
frames of a typical aromatic polyketide synthase are placed on separate vectors. 
Thus, each reading fi^ame can be derived from a different aromatic polyketide 
synthase if desired. 

Another modification useful in varying the polyketides produced regardless of 
1 5 the host cell employed manipulates the PKS, in particular a modular or fungal PKS, to 
inactivate the ketosynthase (KS) on the first module. This permits enhanced 
efficiency m permitting the system to incorporate a suitable diketide thioester such as 
3-hydroxy-2-methyl pantonoic acid-N-acetyl cysteamine thioester, or similar 
thioesters of diketide analogs, as described by Jacobsen ei al Science (1 997) 277:367- 
369. The construction of PKS modules containing inactivated ketosynthase regions is 
described m copending U.S. application 08/675,817 and published in PCT application 
WO97/02358 incorporated herein by reference. These modified PKS modules can be 
employed in the various embodiments of the invention in preparing libraries using 
multivector methods and/or in £. coU and yeast-based production organisms for the 
25 polyketides which may require the additional expression of a gene encoding a suitable 
holo-ACP synthase. 

Thus, the present invention provides the opportunity to produce polyketides in 
hosts which normally do not produce them, such as E. coii and yeast. The invention 
also provides more efficient means to provide a variety of polyketide products by 
3 0 supplying the elements of the introduced PKS, whether in an E, coli or yeast host or in 
other more traditionally used hosts, on multiple separate vectors. The invention also 
includes libraries of polyketides prepared using the methods of the invention. 
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Uses of Polyketides 

As is well understood, the polyketides, in their glycosylated forms, are 
powerful antibiotics. In addition, many polyketides are immunosuppressants and 
anticancer agents It has also been found that polyketides or their glycosylated forms 
5 can reduce inflammation under certain circumstances. This is believed to be due to 
the ability of certain antibiotics to inhibit the release of cytokines such as IL-8 For 
example, Hott, M. in the Kunjme MedicalJournal (] 996) 43:207-217 concludes that 
the favorable clinical effect of erythromycin in cryptogenic organizing pneumonia and 
related conditions is due to inhibition of neutrophil accumulation in the peripheral 
1 0 airways through local suppression of IL-8 production. In further experimental work, 
Tamaoki, J. et al. Antimicrobial Agents and Chemotherapy ( 1 996) 40; 1 726- 1 728 
showed that pretreatment of guinea pigs with roxithromycin or erythromycin inhibited 
the increase in goblet cell secretion when IL-8 was inhaled. Hamada, K. et al. 
Chemotherapy (1995) 41:59-69 showed that the antitumor effect of erythromycin in 
1 5 mice was due to enhancing the production of IL-4. In another study, Keicho, N. et al., 
Journal of Antibiotics (Tokyo) (1993) 46 1406-1413, state that erythromycin has been 
reported to depress the extent of inflammation independent of its antimicrobial action 
and show that erythromycin suppresses the proliferative response of human 
lymphocytes stimulated with mitogens and antigens but had no effect on 
2 0 concanavilin-A induced IL-2 produaion or lL-2R-a expression. Bailly, S. et al. 
A ntimicrobial Agents and Chemotherapy ( 1 99 1 ) 35 ; 201 6-20 1 9 showed that 
roxithromycin, spiramycin and erythromycin have differing effects on production of 
IL-la, IL-1 3 and IL-6 as well as tumor necrosis factor a . Spiramycin, and to a lesser 
extent, erythromycin increase total IL-6 production without affecting IL-la, IL-lp or 
2 5 TNFa. Roxithromycin had no effect. 

Thus, there are a number of papers which indicate that antibiotics are also 
important in modulating inflammatory mechanisms. The literature appears to show 
that erythromycin diminishes the produaion of IL-8, but enhances the production of 
IL-6, IL-1 and IL-2. Spiramycin has been shown to enhance the production of IL-6. 

30 

These examples are intended to illustrate but not to limit the invention. 
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Example 1 

Construction of ]02d. a 6-MSAS Yeast Expression Vector 
Control sequences effective in yeast were obtained and inserted into plasmid 
pBlueScript (Stratagene) along with a polylinker. The S. cerevesiae ADH2 promoter 
5 was amplified by PCR using the following primers: 

forward: GGGAGCTCGGATCCATTTAGCGGCCGCAAAACGTAGGGGC 
reverse: 

CCGAATTCTAGAGGTTTCATATGGTATTACGATATAGTTAATAG 

The forward primer contains 15 bases complementary to the 5' ADH2 
1 0 sequence and introduces Sad (nucleotides 3-8), BamHI (nucleotides 9- 1 4), and NoU 
(nucleotides 20-27) restriction sites The reverse primer contains 15 bases 
complementary to the 3' ADH2 sequence and introduces AUe/ (nucleotides 18-23), 
Xhal (nucleotides 7-12), and £cc»/?/ (nucleotides 3-8) sites. 

The ADH2 terminator was amplified by PCR using the following primers 
15 forward: 

GGGAATTCATAGTCGACCGGACCGATGCCTTCACGATTTATAG 
reverse: 

TTTTCTATTATAAGATGAAAAACGAGGGGAGCTCCCATGGCC. 

The forward primer introduces £co;?/ (nucleotides 3-8), Sail (nucleotides 12- 
2 0 17), and Rsrll (nucleotides 1 7-24) restrictions sites. The reverse primer introduces 
Xhol (nucleotides 29-34) and Asp? 18 (nucleotides 35-40) restriction sites. 

The SacI/EcoRI fragment containing the ADH2 promoter, the EcoR]IAsp718 
fragment containing the ADH2 terminator, and the SacI/Asp7J8 fragment of 
pBlueScript were ligated to produce an intermediate vector, 43d2 which contains 

2 5 cloning sites (L2) for 6MS AS and the gene for the surfactant phosphopantothein 

transferase from B. subiilis (the sfp gene). See Figure 6. It also contains sites (LI, 
L3) for transferring the promoter/terminator cassette into yeast shuttle vectors as well 
as sites (LI, L2) for moving the promoter/gene cassettes from the intermediate 
BlueScript vector into the yeast shuttle vector. 

3 0 The ADH2 promoter/terminator was then introduced into the £. co//7yeast 

shuttle vector pYT (a gift from Dr S. Hawkes, University of California, San 
Francisco) The 13 2-kbp RamHIISall restriction fragment from pYT was ligated to 
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the 757-bp BamHIIXhoI restriction fragment from 43d2 to yield plasmid 101c, which 
contains Leu and Ura markers for selection 

To complete construction of the expression vector, a 5 3-kbp NdcIIXhal 
restriction fragment containing the gene for 6-methylsalicylic acid synthase (6- 
5 MS AS) from Pemcillium paiulum was obtained from demethylated plasmid pDB 102 
(Bedford, D., et ai, J Bacteriology (1995) 177:4544-4548) and iigated into 
//c/e//A^a/-restricted 43d2, yielding intermediate plasmid 71 d The 6 1-kbp 
NoillRsrII restriction fragment from 71d was Iigated to the 12 6-kbp NotlfRsrII 
restriction fragment from 101c to produce the expression vector 102d 

10 

Example 2 

Expression of 6-MSAS in Saccharomyces cerevesiae 
Competent Saccharomyces cerevesiae InvScl (MATa his3Dl leu2 trpl-289 
ura3-52) (Invitrogen) was transformed with 102d, then plated on minimal agar plates 

15 (17 g/L yeast nitrogen base without amino acids or ammonium sulfate (DIFCO), 
5 g/L CNH4)2S04, 20 g/L glucose, 20 g/L agar containing amino acids for selection 
based on uracil prototrophy. Transformants were picked and grown for 24 hours in 
uracil-deficient minimal medium. Plasmid DNA was isolated from the transformants 
and analyzed by restriction digestion analysis to confirm identity. 

2 0 A successful transformant was used to inoculate 2 mL of uracil-deficicnt 

minimal medium and was grown overnight at 30°C on an orbital shaker A 100-uL 
aliquot of this culture was used to inoculate 10 mL of YPD medium (Wobbe, C.R., in 
Current Protocols iti Molecular Biology^ Supplement 34:13.0. 1-13.13.9 (Wiley, 
1996)) (10 g/L yeast extract, 20 g/L peptone, 20 g/L glucose), and the culture was 

2 .S grown at lO'^C on a shaker. 

Cells were collected by centrifugation of 500 uL-aliquots of the culture taken 
after 18 and 36 hours of growth and lysed by boiling in 50 uL of 2x SDS gel loading 
buffer for 2 minutes. 

The cell lysates were analyzed by loading onto 12% SDS-PAGE gels. A band 

3 0 corresponding to the expected size of 6-MSAS was observed at ca. 1 90 kD. 
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Example 3 

Construction of a Holo ACP Synthase Expression Vector 
The Bacillus suhtdis sfp gene encodes a holo ACP synthase, i e , a 
phosphopantothenoyl transferase, and is inserted into plasmid YepFLAG-1 
5 (IBI/Kodak) 

The 5.7-kbp PacIINotI restriction fragment of YepFLAG-1 was hgated with a 
synthetic polylinker to introduce the following restriction sites: 
{Pad) - BamHI - Noil - Ncol - Rsrll - Xhol - SaU - (Noil) 
The original Pad and Noil ligation sites were destroyed in the hgation The 
1 0 resulting vector was cut with BamHI and Sal! and was ligated to BamHIIXhoI- 
digested 43d2 (see Example 1) to introduce the ADH2 promoter/terminator, thus 
obtaining the plasmid 126b. The Bacillus subtilis sfp gene was amplified from the 
plasmid pUC8-sfp (Nakano, M. et ai Mol Gen Genet (1992) 232:3 1 3-321 ) by PCR 
using the primers: 

15 forward: TAGACACATATGAAGATTTACGGAATTTATATG 

reverse: TACATTCTAGAAATTATAAAAGCTCTTCG. 

The forward primer introduces a AUe/restriaion site (nucleotides 7-12) and 
the reverse primer introduces an Xbal site (nucleotides 6-11). 

The resulting PCR fragment was ligated into the Ndel and Xhal sites of 43d2 
20 to produce plasmid 1 09c. 

The 1 .3-kbp BamHI/Sall restriction fragment of 109c was ligated to 
BamHIISalhdxg^sXtd 126b to produce expression vector 128a which contains the sfp 
gene under control of the ADH sequences and tryptophan prototrophy as selection 
marker 

25 

Example 4 

Production of 6-methvlsalicvlic Acid in Yeast 
Competent Saccharomyces cerevesiae InvScl cells were transformed with 
102d (6 MS AS) and 128a (sfp holo ACP synthase). 128a was used in the first 
3 0 transformation with selection for tryptophan prototrophy; a successfiil transformed 
was then transfcctcd with 102d, v/ith selection for tryptophan and uracil prototrophy 
Transformants appeared after 48-72 hr at 30'*C 
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Single colonies of the 6 MSAS/sfp transformants were grown.24-48 hr at 30°C 
in tryptophan- and uracil-deficient minimal medium, after which 100 was used to 
inoculate 10 ml of YPD medium. Cultures were grown for 18 hr at 30°C in an orbital 
shaker at 225 rpm. YPD medium (50 ml) was inoculated with 0,5 ml of the overnight 
5 cultures and incubated at 30°C for 142 hr One ml aliquots were removed 

periodically and the cells were collected by centrifiigation. The cells were suspended 
in SDS-PAGE loading buffer, boiled for 2 min and subjected to SDS-PAGE to 
determine the production of the PKS protein. The supematants were analyzed for 6- 
methylsalicylic acid production by injection of 20 uL onto an HPLC (CIS reverse- 

1 0 phase column, watcr/acetonitrile/acetic acid gradient, diode-array UV detection). The 
LC parameters were as follows: Solvent A = 1% acetic acid in water; Solvent B = 1% 
acetic acid in acetonitrile; gradient = 20% B to 80% B in 30 min then to 100% in 2 
min; flow rate = 0.5 ml/min. The amount of 6-methylsalicylic acid was quantitated by 
peak integration at 307 nm. A standard curve was generated using authentic 6- 

15 methylsalicylic acid (Seidel, J.L , et al,, J Chem Ecaiogy (1990) \6M9\-m6) 
The results of a typical experiment are shown in Figure 7. Yeast which 
contained only the control plasmid 101c or control plasmid and the sfp expression 
plasmid 128a produced no 6-MSA (trace b, d). Yeast containing only the 6-MSAS 
expression vector 102d produced a barely detectable amount of 6-MSA (trace c). 

2 0 Yeast containing both the 6-MSAS expression vector 102d and the sfp expression 

vector 128a produced as much as 1 .7 g/1 of 6-MSA (trace a). 

The kinetics for yeast growth and 6-MSA production for the transformant are 
shown in Figure 8A. As shovm, the open squares represent growth as measured by 
ODfeoo The closed circles represent the production of 6-MSA in g/L. The production 
25 of 6-MSA begins when glucose is depleted consistent, with derepression of the ADH2 
promoter A plateau was reached after about 60 hr of growth and remained constant 
up to 150 hr. 

For large-scale preparation of 6-MSA, a 500 ml yeast culture harboring the 
two plasmids was grown for 120 hr and the cells were removed by centrifiigation 

3 0 The supernatant broth (280 ml) was acidified with 28 ml glacial HO Ac, then extracted 

with 280 mi ethyl acetate. The organic extract was concentrated to dryness under 
reduced pressure. The crude product was purified by crystallization from water and 
the crystals were dried under vacuum over KOH The identity of 6-MSA was 
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confirmed by NMR and mass spec. In the specific experiment described above, the 
280 ml of cell-free yeast culture yielded 240 mg of 6-MSA as crystalline needles 
Shake flask cultures typically produced over I g/L of 6-methylsalicylic acid 



^ Example 5 

Construction of t he DEBS Module 6 KR-ACP-TE Expression Vector Plasmid 104 
The plasmid, 90, which contains a T5 promoter, 2 lac operators, and iac*"^ |?] 
was constructed by ligating a 1 1-kbp XhoIIXbal fragment of pQE60 (Qiagen) to the 
larger XhoIIXba] fragment of pET22b(+) (Novagen). A PstllEcoR] restriction 
1 0 fragment containing the DNA encoding module 6 KR-ACP-TE was ligated into 
plasmid 90 to give plasmid 104, an expression vector for this module. 



Example 6 

Phosphopantothenovlation of Module 6 KR-ACP-TE 
15 A. In vivo . 

The p-alanine auxotroph Escherichia coli SJ16 coli Genetic Stock Center), 
was cotransformed with 104 and a holo-ACP synthase expression plasmid containing 
genes for either: 

E. coli fatty acid synthase holo-ACPS (ACPS); 
2 0 E. coli enterobactin synthetase holo-ACPS (EntD), or 

Bacillus brevis gramicidin synthetase holo-ACPS (GsP). 
Holo-ACPS expression plasmids were generous gifts of Dr. Daniel Santi, 
UCSF (Ku, J., et aL, Chemistry & Biology (1997) 4:203-207). 

Each cotransformant was grown in minimal medium E (Vogel, H.J. et ai, J 
25 Biol Chem (1956) 218:97-106) supplemented with 0.001% thiamine, 0.01% 
methionine, and 100 uM P-alaninc at 37°C for 20 h. Cells were collected by 
centrifiigation and washed with 1 mL of growth medium without p-aianine. This 
wash was repeated four times Finally, the cells were incubated in 1 mL of growth 
medium without P-alanine at 37*'C for 6 h. 
^ Q A 30-uL aliquot of the starved cells was added to 1 mL of growth medium 

supplemented with 0.52 uM [3H]-|3-aIanine (1 uCi, American Radiolabeled 
Chemicals, Inc.). After 6 h at 37X, the cells were induced by addition of IPTG to 1 
mM, kept for an additional 3 h at 37X, and centrifuged. The cell pellet was boiled in 
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SDS gel loading buffer, then analyzed on a 10% SDS-PAGE gel The gel was stained 
with Coomassie Blue, photographed, soaked in Amplify (Amersham), dried, and 
autoradiographed using Kodak Bio-MAX film for 2 days 

The module 6 KR-ACP-TE fragment of DEBS was efficiently labeled upon 
5 coexpression with GsP and with EntD, while no labeling was observed upon 

coexpression with ACPS The inability of ACPS to activate the DEBS fragment is 
expected based on the known inactivity and lack of phosphopentothenoylation of the 
DEBS protein when expressed in£". coli (Roberts etaL Eur J Biochem (1993) 
214:305-31 1). 

1 ^ B fn vitro The module 6 KR-ACP-TE fragment of DEBS was purified 

from E. coll transformed with pi 04 using a Ni^^ affinity column following 
manufacturer's directions (Invitrogcn). Purified surfactin synthetase holo-ACPS (sfp) 
from Bacillus svbtil/s was a gift of Dr. Christopher Walsh (Harvard Medical School). 
Labeled SH-coenzyme A was a gift of Dr. Daniel Santi (UCSF). 

1 ^ All assays were performed in 10 mM MgCU. 50 mM Tris-HCl (pH 8 8), in a 

total volume of 100 uL, and contained 40,000 cpm of 3H-coenzyme A and 0.39 uM 
sfp A positive control contained 1.8 uM PhcAT domain from gramicidin synthetase 
(Dr. Daniel Santi, UCSF) which is normally pantothenoylated by sfp Reactions were 
kept 12 h at 37°C, then boiled in SDS gel loading buffer and analyzed on a 10% SDS- 

2 0 PAGE gel. The gel was suined with Coomassie Blue, photographed, soaked in 

Amplify (Amersham), dried, and autoradiographed using Kodak Bio-MAX film for 2 
days. 

Both PheAT and the module 6 KR-ACP-TE fragment of DEBS were 
efficiently labeled by sfp. 

25 

Example 7 

Produaion of 6-methvlsalicvlic acid in Escherichia coli 
The plasmid 90 (see Example 5) was converted to p95 by inserting a linker 
between the EcoRI/Hmdlll in plasmid 90 so as to introduce restriction sites Ndel and 

3 0 Spel adjacent to the T5 promoter. The 6-MSAS expression vector, 109, was 

constructed by ligating a NdellXbal fragment containing the 6-MSAS open reading 
frame (Pfeifer, E ct ai Biochemistry (1995) 34:7450-7459) with the large NdellSpel 
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fragment of 95 leaving about I kbp of the linker between the Spd and Hindi I J sites of 
the vector 

The sfp expression vector, J 08, was made by ligating a 1 1-kbp EcoRIIPvulI 
restriction fragment of pUC8-sfp (see Example 3) to pACYC-184 (New England 
5 Bioiabs) cut with EcoRV difiev fill-in of the EcoRI site by DNA polymerase I The 
orientation of the sfp gene with respect to the promoter was verified by Hindlll 
digestion 

Plasmids 108 and 109 were cotransformed into E. coli C2453, and 
transfomiants were selected by chloramphenicol and ampicillin resistance. A single 

1 0 colony containing both plasmids was grown in ATCC medium 765 supplemented 

with 1 0% glycerol at 37°C to a density of 1 0 ODgoo then cooled to 30*^0 and induced 
by addition of 0.5 mM IPTG. Cell growth was continued for 36 hr at 30 X Protein 
expression was checked by 10% SDS-polyacrylamide gel. The formation of 6- 
methylsalicylic acid was followed by HPLC analysis of the culture broth 

^ ^ The concentration of 6-MSA was estimated as described in Example 4 from a 

plot of concentration vs integrated are a of corresponding HPLC peak using an 
authentic sample. The identity of the product was confirmed by LC-mass 
spectroscopy, which revealed [M+H]+ = 153, with a major fragment at m/z = 135 
corresponding to loss of HjO. Under these conditions, the culture produced 50 mg/L 
2 0 of 6-methylsalicylic acid. 

The production of 6-MSA in jE". coli was dependent on the presence of the 
plasmid encoding the sfp protein £. coli transformed with only the 6-MSAS 
expression vector, 109, when induced by IPTG followed by incubation at 37°C for 
4 hr, showed produaion of the approximately 190 kD 6-MSAS at about 5% of total 

2 5 protein. However, most of the protein was insoluble and 6-MSA was not detected in 

the medium. When the (i-alanine auxotroph E. coli SJ16 containing the 6-MSAS 
expression vector 109 was incubated v/ith labeled p-alanine before and after 
induction, no radioactivity was found in the 6-MSAS band on SDS-PAGE; thus, it 
appears the 6-MSAS was not modified with the phosphopantothcinyl cofactor by 

3 0 endogenous transferase. In a similar experiment involving E. coli SJ 1 6 cotransformed 

with both plasmid 108 and 109, a detectable amount of radioactivity was found in the 
190 kD 6-MSAS band; however, no 6-MSA was detected under these conditions 
However, when the temperature of incubation was lowered to promote proper protein 
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folding and glycerol was added to the medium to increase levels of intracellular 
malonyl CoA substrate, production of 6-MSA was improved. Thus, when cells were 
grown at 30^C in the absence of glycerol or at 37X in the presence of 10% glycerol, 
no 6-MSA was produced. However, when grown as described above at 30°C In the 
^ presence of 10% glycerol, 6-MSA was produced up to about 75 mg/L after 24 hr of 
incubation. The kinetics of production are shown in Figure 8B. 

Example 8 

Production of 6-mcthvlsalicvlic acid in Saccharomyces cerevesiae 
10 using a PKS-holo ACP synthase fusion protein 

A fusion protein between the PemciUium paiulum 6-methylsaIicylic acid 
synthase (6-MSAS) and the Bacillus subtilis surfactin holo ACP synthase (sfp) was 
made as follows; 

A 5.3-kbp NdellHindlll fragment containing the 6-MSAS gene (see Example 
15 1 ) was ligated with a 708-bp HindllllXbal fragment containing the sfp gene (see 
Example 3) and with NdcIIXbal-vtslncXtA 43d2 (see Example 1) to produce 
intermediate plasmid 69. A ca. 6-kbp NotllRsrII restriction fragment from 69 was 
ligated with Notl/RsrII-rtstnciGd 101c (see Example 1) to yield the yeast expression 
vector 26a 1 (see Example 1). This vector contains the 6-MSAS/sfp fusion gene 
2 0 between the ADH2 promoter/terminator pair. 

The resulting fusion protein consisted of connecting the C-terminal lysine of 
6-MSAS with the N-terminal methionine of sfp using an (alanine)3 linker, such that 
the DNA sequence of the gene in the region of the fusion was: 

5'-AAGCTTGCCAAA-GCCGCCGCC-AIQAAGATTTAC-3' 

2 5 where the lysine and methionine codons are underlined. 

Transformation ofS. cerevesiae InvScl with 26al and culturing as described 
in Example 3 resulted in produaion of 6-mcthylsalicyIic acid at a level comparable 
with that resulting from expression of 6-MSAS and sfp as separate genes. The fusion 
protein thus combines the enzymatic activities of 6-MSAS and of sfp, self 

3 0 phosphopantothcnoylates, and produces polyketidc product. 

This is especially useful for transformation of hosts where the number of 
plasmid replicons useable for expression vectors is limited, where polycistronic 
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messages are not properly processed, or where transformation with myltiple veaors is 
difficult and/or time-consuming. 

Example 9 

5 Production of 6-deoxvervthronolidc B bv mixed chromosomal/plasmid expression 
systems in Streptomvces lividans using chromosomal integration 
To demonstrate the feasibility of dividing the three DEBS genes between 
chromosomal and plasmid expression systems, two experiments were performed In 
both experiments, the integrating vector pSET152 (Bierman, M., ciaL, Gene (1992) 

1 t^J JJ_6;43-49) was used to place one gene of the DEBS gene cluster under control of the 

actinorhodin promoter onto the Streptomyccs chromosome at the phage attachment 
site. The remaining genes were placed onto the replicating plasmid, pRM5 
(McDaniel ct aL, Science (1993) 262:1546-1550), also under control of the 
actinorhodin promoter. 
15 A The eryAIII gene (encoding modules 5 and 6 and the thioesterase of 

DEBS) under control of the actinorhodin promoter was cloned into pSET152. The 
resulting vector was used to transform S. lividans K4-1 14, a strain in which the 
actinorhodin gene has been deleted by homologous recombination by standard 
methods (US patent application 08/238,81 1 incorporated herein by reference). 

2 0 Apramycin-resistant transformants were selected. 

An expression plasmid was constructed by cloning the eryAI and eryAH genes 
(containing modules 1+2 and 3+4, respectively) into the PacI/EcoRI sites of pRM5 so 
that the two genes were under the control of the actinorhodin promoter This plasmid 
was used to transform protoplasts of the S. lividans clone contaming the integrated 

2 5 eryAIII gene, and colonics resistant to both thiostrepton and ap-amycin were selected 

B. Alternatively, the actinorhodin promoter and the eryAI gene were 
cloned into pSET152 and subsequently integrated into the lividans chromosome 
The eryAn and eryAIII genes were cloned into pRM5 behind the actinorhodin 
promoter, and this plasmid was used to transform the *S'. lividans strain containing the 

3 0 integrated eryAI gene. 

Randomly selected colonies of the above organisms containing mixed 
chromosomal-plasmid expression systems were cultured on R2YE medium over 
XAD-16 resin, and ethanol extracts of the resin collected after 7 days were analyzed 
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for production of 6-deoxyerythronolide B by LC/mass spectrometry . Cultures from 
both experiments A and B produced 6-deoxyerythronolide B at levels of 15-20 mg/L. 
comparable to that found in extracts of cultures of S, lividans contaimng pCK7, a 
replicating plasmid containing all three eryA genes under control of the actinorhodin 
5 promoter 

Example 10 

Production of 6-deoxvervthronolidc B bv mixed chromosomal/plasmid expression 
systems in Streptomvces Uvidans 

10 An alternative method for constructing a mixed chromosomal-plasmid 

expression system for multi-gene PKSs also achieves simultaneous creation of a clean 
host for polyketide production. A suitable expression host, which normally produces 
a polyketide product, has its chromosomal PKS genes replaced by a subset of the 
foreign PKS genes through homologous recombination. This accomplishes the 

1 5 desired chromosomal integration of the foreign PKS genes while simultaneously 
eliminating interference from and competition by the native PKS. The example is 
readily illustrated for 5. coelicolor and S. lividans, both of which make the blue 
polyketide actinorhodin. 

A method by which the entire actinorhodin gene cluster is removed from these 

2 0 organisms and replaced with an antibiotic marker through homologous recombination 
has been described (US patent application 08/238,81 1). This method is adapted as 
follows: The recombination vector consists of any vector capable of generating 
single-stranded DNA (e.g., pBlueScript) containing the following elements: 1) a DNA 
sequence homologous to the 5* 1-kbp end of the act cluster; 2) a resistance marker 

2 5 (e.g., hygromycin or thiostrepton); 3) the act n-orf4 activator gene; 4) the act 

promoter; 5) one or more genes of the foreign PKS; and 6) a DNA sequence 
homologous to the 3' 1-kbp end of the act cluster. Transformation ofS. coelicolor or 
.V. lividans with the recombination vector followed by selection for hygromycin 
resistance and screening for loss of blue color provides a host lacking the actinorhodin 

3 0 gene cluster and containing a chromosomal copy of the foreign PKS genes along with 

the needed actinorhodin control elements. This host is subsequently transformed by 
replicating vectors (e.g., SCP2*-based plasmids) and/or with integrating phage 
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vectors (e.g., pSET152) containing other genes of the foreign PKS to.complete the set 
of PKS genes and produce polykelide product. 

Example 1 1 

^ Construction of Yeast Vectors for Expression of an Aromatic Minimal PKS 

The genes encoding the KS/AT bifunctional protein and the CLF gene of the 
actinorhodin PKS (diagrammed in Figure 5) are amphfied and tailored by PGR and 
cloned into the yeast expression vector pYEUra3 (Clontech) under control of the Gall 
and GallO promoters respectively The ACP gene is amplified and cloned together 

1 0 with the holo-ACP synthase gene, if necessary, into a plasmid derived from pYEUra3 
by replacement of the Ura3 gene with the Leu2-d gene Expression is also driven by 
the Gall and GallO promoters respectively. Yeast strain BJ2168 is cotransformed 
with these plasmids and also with plasmid 128a (see Example 3) and transformants 
selected on a uracil- and leucine-deficient plates by standard methods. Expression is 

1 5 induced by growth in 2% galactose according to the manufacturer's instructions. The 
polyketide produced by this synthase system is predicted to be 

H3C OH 



HO J 

J . 1. 
HO^ ^ 0 



Example 12 

Construction of Yeast Vectors for Expression of Modular Synthase Activities 
Two vectors are constructed. One contains the putative two-module system of 
spiramycin under control of the ADH-2 promoter and colinear with the thioesterase 
domain of the erythromycin PKS, The coding sequence construct is engineered to be 
flanked by an Ndel site at the initiation codon and an Nsil site following the 
termination codon; this construct is cloned using synthetic oligonucleotide linkers into 
pYT. 
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In the second vector, the analogous structure from the erythrojnycin PKS 
system flanked by Ndel and Nsil sites as described by Kao, C et ai J Am Chem Soc 
(1995) 117:9105-9106 is cloned into pYT so as to be placed under control of the 
ADH-2 promoter. Figure 9 shows the relevant expression portion of these vectors and 
5 the expected polyketidc products. 
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Claims 

1. A modified recombinant host cell, which, in unmodified form, does not 
produce polyketides, which cell is modified to contain an expression system for a 
5 minimal polyketide synthase (PKS) and an expression system for a holo ACP 
synthase, 

said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
protein (ACP) activity for an aromatic PKS; or 

1 0 said minimal PKS comprising a KS catalytic region, an AT catalytic region, 

and an ACP activity for a modular PKS or a fungal PKS. 

2. The modified cell of claim 1 which is £. coli or yeast. 

15 3 The modified cell of claim 1 wherein said PKS is the synthase for 6- 

methyl salicylic acid. 

4. The modified cell of claim 1 wherein the nucleotide sequence encoding 
said holo ACP synthase and the nucleotide sequence encoding at least a portion of 

2 0 said minimal PKS arc fiiscd so as to encode a fusion protein. 

5. The modified cell of claim 1 wherein said expression system for said 
minimal PKS and said expression system for said holo ACP synthase are present on 
separate vectors. 

25 



6. The modified cell of claim I wherein at least one of said expression 
systems is integrated into the host cell chromosome. 
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7 A method to produce a polyketide which method comprises culturing 
the cells of claim 1 under conditions wherein said expression systems produce the 
encoded proteins and wherein said polyketide is synthesized 

5 8 A recombinant host cell modified to cxjntain either 

a) at least two vectors, said first vector containing a first selectable 
marker and a first expression system and said second vector containing a second 
selectable marker and a second expression system and optionally additional vectors 
containing additional selcaable markers and expression systems wherein said 

1 0 expression systems contained on said vectors are effective to produce at least a 
minimal polyketide synthase (PKS); or 

b) at least one vector and a modified chromosome, said one vector 
containing a first selectable marker and a first expression system and said modified 
chromosome containing a second expression system and optionally additional vectors 

1 5 containing additional selectable markers and expression systems wherein said 

expression systems contained on said vectors in combination with said expression 
system on said chromosome are effective to produce at least a minimal PKS; 

said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
2 0 protein (ACP) activity for an aromatic PKS; or 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
and an ACP activity for a modular PKS. 

9. The cell of claim 8 which is a yeast cell, an £. call cell, an 
2 S actinomycete cell or a plant cell. 

10 The cell of claim 8 which further contains an expression system for a 
cell-based detection system for a functional polyketide 
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1 1 The cell of claim 8 which produces at least a minimal aromatic PKS 
and which contains: 

(a) a first vector comprising a first selectable marker and an expression 
system comprising a nucleotide sequence encoding a KS/AT catalytic region operably 

5 linked to a promoter operable in said cell; 

(b) a second vector comprising a second selectable marker and an 
expression system comprising a nucleotide sequence encoding a CLF catalytic region 
operably linked to a promoter operable in said cell; and 

(c) a third vector containing a third selectable marker and an expression 
1 0 system which comprises a nucleotide sequence encoding an ACP activity operably 

linked to a promoter operable in said cell. 

12. The cell of claim 8 which produces at least a minimal modular PKS 
and which contains 

1 ^ (a) a first vector containing a first selectable marker and an expression 

system for at least one module of a polyketidc synthase (PKS) operably linked to a 
promoter operable in said cell; and 

(b) a second vector containing a second selectable marker and a nucleotide 
sequence encoding at least a second module of a polyketide synthase operably linked 

2 0 to a promoter operable in said cell. 

13. The cell of claim 12 wherein said first and second module arc derived 
from different polyketide synthases. 

2 5 14. The ceil of claim 13 wherein said nucleotide sequence encoding at 

least one module further contains a nucleotide sequence encoding a KR activity; or 

wherein the nucleotide sequence encoding at least one module encodes a KR 
and DH activity; or 
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wherein said nucleotide sequence encoding at least one module encodes a KR, 
DH and ER activity, and/or 

wherein said nucleotide sequence encoding at least one module encodes a 
thioesterase (TE) activity. 

S 

15. A method to produce a potyketide which method comprises cultunng 
the cells of claim 8 under conditions wherein said expression systems produce the 
encoded proteins and wherein said polyketide is synthesized. 

10 16. The cell of claim 8 which is further modified to contain a recombinant 

expression system for a holo ACP synthase. 

17 A method to produce a polyketide which method comprises culturing 
the cells of claim 16 under conditions wherein said expression systems produce the 

1 5 encoded proteins and wherein said polyketide is synthesized. 

18 A library of polyketide synthases PKS or synthesized polyketides 
which comprises a panel of individual colonies, each colony containing either 

a) at least two vectors; said first vector containing a first selectable 

2 0 marker and a first expression system and said second vector containing a second 

selectable marker and a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 
expression systems contained on said vectors are efiFective to produce at least a 
minimal polyketide synthase (PKS), or 

25 b) at least one vector and a modified chromosome, said one vector 

containing a first selectable marker and a first expression system and said modified 
chromosome containing a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 
expression systems contained on said vectors in combination with said expression 

3 0 system on said chromosome arc effective to produce at least a minimal PKS, 
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said minimal PKS comprising a ketosynthase/acyl transferaseL(KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
protein (ACP) activity for an aromatic PKS; and 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
5 and an ACP activity for a modular PKS 

wherein the combination of vectors or of vcctor(s) and modified chromosome 
is different in each colony 

19. The library of claim 18 wherein said colonies are colonies of yeast, 
10 E, coli, actinomycetes or plant cells. 

20. The library of claim 18 wherein each colony further contains an 
expression system for a cell-based detection system for a functional polyketide. 

1 ^ 21 The library of claim 1 8 wherein the PKS are aromatic PKS and each 

colony contains: 

(a) a first vector comprising a first seleaable marker and an expression 
system comprising a nucleotide sequence encoding a KS/AT catalytic region operably 
linked to a promoter operable in said cell; 

2 0 (b) a second vector comprising a second selectable marker and an 

expression system comprising a nucleotide sequence encoding a CLF catalytic 
domain operably linked to a promoter operable in said cell. 

(c) a third vector containing a third selectable marker and an expression 
system which comprises a nucleotide sequence encoding an ACP activity operably 
2 5 linked to a promoter operable in said cell; 

wherein said combination of first, second and third vectors is different in each 

colony. 
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22 The library of claim 1 8 wherein the PKS are modular PKS wherein 
each colony contains 

a first vector containing a first selectable marker and an expression for at least 
one module of a PKS operably linked to a promoter operable in said cell; and 

5 a second vector containing a second selectable marker and a nucleotide 

sequence encoding at least a second module of a polyketide synthase operably linked 
to a promoter operable in said cell; 

wherein said combination of first and second vectors is different in each 

colony. 

10 

23 The library of claim 22 wherein said nucleotide sequence encoding at 
least one module further contains a nucleotide sequence encoding a KR activity; or 

wherein the nucleotide sequence encoding at least one module encodes a KR 
and DH activity, or 

1 5 wherein said nucleotide sequence encoding at least one module encodes a KR, 

DH and ER activity; and/or 

wherein said nucleotide sequence encoding at least one module encodes a 
thioesterase (TE) activity. 
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24 The library of claim 18 wherein each colony further contains a 
recombinant expression system for a holo ACP synthase 



25. A method to produce a library of polyketides which method comprises 
culturing the cells of claim 18 under conditions wherein said expression systems 
2 5 produce the encoded proteins and wherein said polyketide is synthesized. 



26 A method to produce a library of polyketides which method comprises 
culturing the cells of claim 24 under conditions wherein said expression systems 
produce the encoded proteins and wherein said polyketide is synthesized. 
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27. A method to identify a polyketide that binds a target receptor which 
method comprises contacting said receptor with each member of the library of claim 
18 under conditions wherein binding to said receptor can be detected; and 

5 detecting the presence or absence of binding to said receptor with respect to 

each member, whereby 

a member that binds to a receptor is identified. 

28. A method to identify a polyketide that binds a target receptor which 

1 0 method comprises contacting said receptor with each member of the library of claim 
24 under conditions wherein binding to said receptor can be detected; and 

detecting the presence or absence of binding to said receptor with respect to 
each member, whereby 

a member that binds to a receptor is identified. 

15 

29 A method to identify a polyketide functional in a cell-based detection 
system which method comprises assessing each member of the library of claim 18 

for the presence or absence of signal in said cell-based detection system 

whereby a functional polyketide is identified. 

20 

30 A vector adapted for expression in yeast which vector contains a 
selectable marker operable in yeast, and an expression system which comprises the 
coding region of at least one functional polyketide synthase catalytic activity operably 
linked to a promoter operable in yeast. 

25 



31. 



A yeast cell modified to contain the vector of claim 30. 
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32. The yeast cell of claim 3 1 which further contains a recombinant 
expression system for a holo ACP synthase 

33. A method to produce a polyketide synthase activity which method 

5 comprises culturing the yeast cell of claim 3 1 under conditions wherein expression is 
favored. 

34. A method to produce a polyketide synthase activity which method 
comprises culturing the yeast cell of claim 32 under conditions wherein expression is 

10 favored. 

35 A vector adapted for expression in E. coli which vector contains a 
selectable marker operable in £. coU, and an expression system which comprises the 
coding region of at least one functional polyketide synthase catalytic activity operably 
1 linked to a promoter operable in K coli. 



2 0 expression system for a holo ACP synthase. 

38. A method to produce a polyketide synthase activity which method 
comprises culturing the £. coli cell of claim 36 under conditions wherein expression is 
favored. 

25 



36 



An £, coli cell modified to contain the vector of claim 35 



37. 



The E. coli cell of claim 36 which further contains a recombinant 



39 A method to produce a polyketide synthase activity which method 
comprises culturing the E, coli cell of claim 37 under conditions wherein expression is 
favored. 
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